What's News, What's Not? Associating News Videos with Words
نویسندگان
چکیده
Text retrieval from broadcast news video is unsatisfactory, because a transcript word frequently does not directly ‘describe’ the shot when it was spoken. Extending the retrieved region to a window around the matching keyword provides better recall, but low precision. We improve on text retrieval using the following approach: First we segment the visual stream into coherent story-like units, using a set of visual news story delimiters. After filtering out clearly irrelevant classes of shots, we are still left with an ambiguity of how words in the transcript relate to the visual content in the remaining shots of the story. Using a limited set of visual features at different semantic levels ranging from color histograms, to faces, cars, and outdoors, an association matrix captures the correlation of these visual features to specific transcript words. This matrix is then refined using an EM approach. Preliminary results show that this approach has the potential to significantly improve retrieval performance from text queries.
منابع مشابه
Influenza news from the frontline: what's happening?
Chief Editor Anita Simonds explains what's happening on the frontline of the flu season http://ow.ly/fBW630iiYC5.
متن کاملWhat's new and what's next.
Some of the latest developments in HIV treatment are described and contact information is given for people seeking further information on the topic. Clinical studies and preliminary results for interleukin-2 (IL-2), 1592U89 (abacavir), DMP-266 (efavirenz or Sustiva), and 141W94 (VX-94, Vertex) are described. Pharmaceutical companies are expected to continue developing new HIV drugs, including t...
متن کاملRecognizing Objects and Scenes in News Videos
We propose a new approach to recognize objects and scenes in news videos motivated by the availability of large video collections. This approach considers the recognition problem as the translation of visual elements to words. The correspondences between visual elements and words are learned using the methods adapted from statistical machine translation and used to predict words for particular ...
متن کاملIdentifying Important People in Broadcast News Videos
Automatic face identification in multimedia archives such as broadcast news videos is useful for indexing or retrieving documents based on important persons that appear in the video. In this paper, we propose a system which automatically detects a list of important targets such as anchor speakers or active politicians in broadcast news videos. This involves several steps including detecting fac...
متن کاملWhat's new, what's next?
There are a number of new developments in the fight against the HIV epidemic. New drugs are being developed, results are being reported from a number of clinical trials, and there are some new strategies in disease management. Currently, there are eleven approved medications for HIV. Several new drugs or studies are highlighted: adefovir dipivoxil (preveon), which is now available under an expa...
متن کامل